ASTROPHYSICAL JOURNAL 699 (2009) 1252-1260. 
Preprint typeset using K^'T^]X style emulateapj v. 04/20/08 



THE SIZE, SHAPE AND ORIENTATION OF COSMOLOGICAL VOIDS IN THE SLOAN DIGITAL SKY 

SURVEY 

Caroline Foster^'^ and Lorne A. Nelson^ 

^ Physics Department, Bishop's University, 2600 College Street, Sherbrooke, Quebec Canada JIM 1Z7 
^Centre for Astrophysics & Supercomputing, Swinburne University, Hawthorn, VIC 3122, Australia 
Astrophysical Journal 699 (2009) 1252-1260. 

Abstract 

We present a detailed description of our void finding algorithm which is an extension of the pre- 
scription by Hoyle and Vogeley (2002). We include a discussion of the reproducibility and robustness 
of the algorithm as well as the statistical significance of the detected voids. We apply our void finder 
to the Data Release 5 (DR5) of the Sloan Digital Sky Survey (SDSS) and identify 232 cosmological 
voids. A void catalog which contains the most salient properties of the detected voids is created. We 
present a statistical analysis of the distribution of the size, shape and orientation of our identified 
cosmological voids. We also investigate possible trends with redshift for 0.04 < z < 0.16. We compare 
our results to those from an identical analysis of a mock catalog based on the ACDM model and find 
reasonable agreement. However, some statistically significant differences in the overall orientation of 
cosmological voids are present and will have to be reconciled by further refinement of the simulations. 
Subject headings: cosmology: large-scale structure of Universe - methods: data analysis - SDSS 



1. INTRODUCTION 

In the 1970s and the 1980s, large-scale redshift surveys 
have revealed the bubbly structures in our universe on a 
megaparsec (Mpc) distance scale. Although some degree 
of clustering was expected, the discovery of large-scale 
structures was a surprise (Fairall 1998). Cosmological 
voids are large regions that are nearly devoid of lumi- 
nous matter and are surrounded by walls, filaments and 
clusters of galaxies. As was first proposed by Zel'dovich 
(1970), they are now best explained as the result of grav- 
itational instabilities in the early universe. Because of 
their enormous size, cosmological voids must have formed 
early in the history of the universe and evolved to their 
present state as expanding low-density regions as pre- 
dicted by the cold dark matter models with cosmological 
constant (ACDM). 

Significant improvements in computational power have 
made it possible to model the formation and evolution 
of large-scale structures through N-body simulations (an 
example being the Millennium Simulation, see Springel 
et al. 2005). Studies of the distribution of sizes of voids 
in the observed galaxy distribution have been performed 
(see for example, Plionis & Basilakos 2002; Muller et al. 
2000; Muller & Maulbetsch 2004). An analysis of void 
properties with respect to CDM models and the effects 
of galaxy/dark matter biasing has been carried out by 
several authors (e.g., Mathis & White 2002; Benson et al. 
2003; Tikhonov & Klypin 2009; Tinker & Conroy 2009). 
Indirect evidence of the size evolution of cosmological 
voids has been provided by Conroy et al. (2005) who 
find that the observed void probability function^ at low 
redshift {z « 0) as measured in the Sloan Digital Sky 
Survey (SDSS) and high redshift {z ~ 1) as measured 
in the DEEP2 Redshift Survey is consistent with that 
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^ The void probability function (VPF) is a measure of the prob- 
ability that a sphere of a given size is devoid of galaxies in a given 
survey. 



expected within the ACDM framework. 

With the increasing amount of observational data now 
becoming available, it is important to investigate the dis- 
tribution of the properties of cosmological voids. Indeed, 
the fifth data release of SDSS (Adelman-McCarthy et al. 
2007) contains sufficient data to be able to select sta- 
tistically homogeneous galaxy samples out to a redshift 
of about 0.16. It is thus possible to look at the overall 
distribution of sizes, shapes and orientations of voids. 

Moreover, Ryden & Mellot (1996) proposed a method 
of verifying the evolution of voids by studying redshift- 
space distortions via the shape and orientation of cos- 
mological voids in redshift space. The redshift is a mea- 
sure of the speed of recession of an object with respect 
to the line-of-sight. For this reason, peculiar velocities 
(e.g., random velocity dispersions in virialized structures 
and coherent infall), large-scale cosmological distortion 
as well as the expansion of the universe all have an im- 
pact on the measured redshift. Unfortunately, these fac- 
tors are hard to deconvolve. 

The basic idea is the following: If a hypothetical spheri- 
cal void with negligible galaxy velocity dispersion around 
its boundary is expanding in real space axisymmetrically, 
it will appear as an elongated ellipse with its semi-major 
axis aligned with the line-of-sight in redshift space. On 
the other hand, if such an idealized void is collapsing in 
real space, its semi-major axis will intersect the line-of- 
sight at 90° in redshift space. Thus, under the assump- 
tions that 1) the intrinsic shape of the majority of voids 
is on average spherical in real space, and 2) the galaxies 
on the boundary of voids have negligible velocity dis- 
persion, then if most voids in redshift space are found 
to intersect the line-of-sight at angles smaller than the 
average expected from a random distribution in three di- 
mensions (i.e. {(p) < 57.3°), it would indicate that voids 
are expanding in real space on average. Otherwise, if 
cosmological voids in astronomical data are found to in- 
tersect the line-of-sight at angles {(j)) > 57.3°, it would 
suggest that most voids are collapsing in real space. Of 
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course, the effects of other redshift-space distortions on 
the distribution of cosmological voids such as 'fingers of 
God' and the more subtle Kaiser effect (Kaiser 1987) 
caused by the coherent infall towards/away from over- 
densities/under-densities can play an important role. For 
example, fingers of God stretch inside the voids thereby 
reducing their size, and changing their shape as well as 
their orientation. As shown by Rydcn & Mcllot (1996), 
the net redshift-space distortions of cosmological voids 
(i.e. elongation or compression along the line-of-sight) 
in cosmological simulations depends on the initial condi- 
tions and cosmological parameters (e.g. the slope of the 
power-spectrum and Qioi)- This suggests that a compar- 
ison of the shape and orientation of cosmological voids in 
observational data with the predictions of semi-analytic 
models could be a useful test of the cosmological and 
semi-analytic models' input parameters. 

In order to do this, one must find an objective and 
accurate way of identifying and quantifying the proper- 
ties of voids. Several void finding algorithms have been 
created in order to fulfill these objectives. There are a 
wide variety of definitions and techniques that have been 
employed by various groups. Only recently has a seri- 
ous attempt been made to compare and contrast these 
different algorithms (see Colberg et al. 2008, and refer- 
ences therein for an overview of the different void finding 
algorithms in use, including the present). We examined 
several algorithms and selected the one that we felt was 
best suited to a robust analysis of the distribution of 
sizes, shapes and orientation of cosmological voids. We 
adopted an algorithm that objectively identifies voids in 
the observed galaxy distribution in a way that best repro- 
duces the results of a visual inspection. Our algorithm is 
an adaptation and extension of the algorithm by Hoyle 
& Vogeley (2002, hereafter HfcV) which has been exten- 
sively employed to identify voids and study their proper- 
ties as well as those of their constituent galaxies in sev- 
eral redshift surveys and structure formation simulations 
(Hoyle & Vogeley 2004; Hoyle et al. 2005). 

This paper is divided as follows: §n describes our void 
finding algorithm, while §111, §IV and §V treat the repro- 
ducibility of our results, robustness of the algorithm and 
statistical significance of voids, respectively. §VI gives 
a brief description our Sloan Digital Sky Survey sample 
and §Vn presents the properties of cosmological voids in 
our sample and a comparison to a mock catalog. Our 
conclusions are given in §Vni. 

2. VOID FINDING ALGORITHM 

The void finding algorithm is based on the prescription 

of Hoyle & Vogeley (sec H&V, 2004; Hoyle ct al. 2005). 
It has been thoroughly tested on the Updated Zwicky 
Catalog (UZC) and the Point Source Catalog Redshift 
survey (PSCz). The algorithm is subdivided into seven 
distinct steps: (i) data input; (ii) classification of galaxies 
as field or wall galaxies; (iii) detection of the empty cells 
in the distribution of wall galaxies; (iv) growth of the 
maximal sphere; (v) classification of the unique voids; 
(vi) enhancement of the void volume; and (vii) calcula- 
tion of the void properties. Below is a detailed descrip- 
tion of each step. 

2.1. Data Input 



Once the dataset has been selected, equatorial coordi- 
nates are converted to comoving Cartesian coordinates 
{Xc, Yc, Zc). This is carried out using the following for- 
mulae: 

^c = l^csin(^(90°-.))cos(^) (1) 
n = ^csin(^(90°-.))sin(^) (2) 

Ze = DeCOs(^(90°-5)) (3) 

where a is the right ascension and S is the declination 

(both in degrees). Also note that Dc is the comoving 
distance in megaparsecs (Mpc) given by the following 
formula (Hogg 2000): 



" Ho Jo ^n^{l + z')^ + Qfc(l + z')^ + Qa ' 

where z is the redshift. We have adopted the following 
values for each of the parameters: Hq = 100/;,, ilm = 
0.28, Qk = 0, and = l-^m-^k = 0.72. These values 
were chosen based on the results of WMAP (Spergel et 
al. 2007). 

2.2. Classification of galaxies as field or wall galaxies 

From the data, the average distance to the third near- 
est galaxy {D3) as well as the standard deviation (0-3) of 
that value are computed. The parameter Rs is defined 
such that 

R3 = D3 + Xa3, (5) 

where A is an adjustable dimensionless parameter that 
together with the threshold parameter (^) most strongly 
affects the inferred number and sizes of voids. The 
threshold parameter is simply defined as the minimum al- 
lowed radius of a void. After a thorough analysis of the 
robustness of the algorithm using the Updated Zwicky 
Catalog, we chose values of A = 2.0, and ^ = 12/i~^Mpc. 
These values of A and ^ yield the smallest variation in the 
number of voids found with respect to the other param- 
eters (see Section 4). Moreover, relatively large values 
of ^ guarantee that all voids that are identified by the 
algorithm are legitimate cosmological voids as opposed 
to spurious voids. 

Wall galaxies are then defined as those whose third 
nearest neighbor is closer than R^. All other galaxies 
are field galaxies, and are the only ones allowed to exist 
inside a void. 

2.3. Detection of the empty cells in the distribution of 

wall galaxies 

The wall galaxies are placed in a grid whose basic cell 
is cubical and has a side of length Rs/2. Dividing the 
cell size by two yields 2^ times more holes than those 
used by Hoyle and Vogeley, and thus it allows us to 
sample the volume more precisely. The entire list of 
wall galaxies is sorted and the cells that contain no wall 
galaxies are identified. The center of each empty cell 
Ci = {Xhoiei, Yhoiei, Zhoiei) IS recorded for later use. 
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2.4. Growth of the maximal sphere 

The position Gi = (^ga/i; ^ga/ii ^gaa) of the nearest 
galaxy to the center of every empty ceh is recorded. An 
empty sphere is defined as having the center of the empty 
cell as its center and a radius constrained by the nearest 
galaxy located on its surface. 

A first growth vector, vt, pointing from the nearest 
galaxy to the center of the empty sphere is then com- 
puted and the radius is effectively increased in the direc- 
tion of vt until another galaxy is found to intersect the 
surface. The algorithm goes through the entire list of 
galaxies and finds the galaxy G2 = {Xgan.Ygan, Zgan) 
which yields the smallest sphere whose center C2 = 
{Xhoie2-,Yhoie2-, Zhoie2) has movcd along vt such that Gi 
and G2 both lie on the surface of the sphere. 

A second growth vector, t^, is defined from the mid- 
point between Gi and G2 to the second center C2. In 
a similar fashion to the first growth, the radius is effec- 
tively increased by moving the center in the direction of 
V2 until a third galaxy intersects the surface. The cen- 
ter C3 = {X holes, Yhoies, Zhoies) of the fuUy grown sphere 
together with its final radius constitute a hole. 

The algorithm then checks that each hole lies within 
the boundaries of the survey. Because legitimate holes 
(and voids) may exist near the edge and reach beyond 
the survey boundaries, we allow for part of the hole to 
exist outside the boundary by extending it by 5-10 Mpc. 
Should the boundaries of the survey extend into regions 
where extinction due to the disk of the Milky Way is 
significant, then the holes along that line-of-sight are re- 
moved. The effect of extending the survey boundary (Ex- 
tension) and of varying the range of galactic latitudes 
extinct by the disk of the Milky Way (MW width) are 
discussed in Section [H 

2.5. Classification of the unique voids 

The holes are ordered from the largest to the small- 
est. If the radius of the first hole is larger than ^, it is 
identified as void number one. The program then goes 
through the entire list of holes whose radii exceed ^. If a 
hole overlaps any one of the constituent holes of at least 
two voids by more than fj^ = 2% of its volume, it is re- 
jected; if it overlaps a previously defined void by at least 
Pi = 10%, it is merged with that void. Otherwise, if it 
does not overlap any previously defined void, it is a new 
void. 

2.6. Enhancement of the void volume 

The holes that have radii smaller than ^ are considered 
next. In a similar manner as for the large holes, if a small 
hole overlaps constituent large holes of at least two voids 
by more than /^a, it is rejected. Otherwise, if it overlaps 
one and only one large hole by at least [32 = 50%, it is 
merged with the corresponding void. Expansion of the 
volumes of the voids by this method allows for the growth 
of aspherical voids. 

2.7. Calculation of the void properties 

Once the constituent holes of each void have been iden- 
tified, we compute the final position, volume and equiv- 
alent spherical radius of each void using Monte Carlo 
methods. 



2.7.1. Determination of Shapes 

Quantifying the shape of voids is non-trivial because 
the agglomeration of empty spheres yields voids of com- 
plex shapes. This problem is much like the one of quan- 
tifying the shape of clusters of astronomical objects. 

We have used the method of the best fit ellipsoid simi- 
lar to that used by Jang-Condell & Hernquist (2001) and 
Platen et al. (2007). We use Monte Carlo methods to 
compute the shape tensor S: 

N 

Sij = ^ rukrkirkj (6) 

k=l 

where is the distance of the fc*'' particle to the center 
of the void, i and j represent the spatial components 
and we assume that = 1 for all k randomly placed 
particles inside the void. We then find the eigenvalues of 
5S/M (where M — J2k=i which are the square of 
the sizes of the principal axes of the best fit ellipsoid (a^, 
b"^ and c^). 

We also compute ei, 62 and the triaxiality T given by 
the following equations: 





b 


: 1 - 






a 




c 


: 1 - 




6' 








c2 



Thus, a prolate void in redshift space has ei > 0, €2 = 
and T — 1. An oblate one yields ei = 0, 62 > and 
T = 0. Finally, ei = 62 = corresponds to a perfectly 
spherical void. 

2.7.2. Orientation of Voids in Redshift Space 

The eigenvectors of 5S/M are along the principal axis 
of the best fit ellipsoid corresponding to the respective 
eigenvalues found above. Having the vector along the 
semi-major axis enables us to compute the orientation 
(i.e., (p) of the void. The orientation of a cosmological 
void is defined in such a way that </> is 0° when the semi- 
major axis is aligned with the line-of-sight and = 90° 
when the semi-major axis is perpendicular to the line-of- 
sight. 

3. REPRODUCIBILITY: THE UPDATED ZWICKY 
CATALOG (UZC) 

The UZC is made up of data from both the original 
Zwicky Catalog (ZC) and the CfA redshift survey to 
mzwicky ^ 15.5. At the time that the CfA survey was 
carried out, measuring a redshift was a tedious process. 
Therefore, redshifts from the previous literature were 
used and this led to a very inhomogeneous database. The 
UZC was an attempt at improving the consistency of this 
database and provides a revised 2" accuracy set of co- 
ordinates for the objects in the ZC. The catalog contains 
19,369 objects, 18,633 of which have measured redshifts 

for the main survey regions of 20^ ^ 0^1950 ^ and 

8^ s$ ai950 17^\ and both with -2.5° s$ (^1950 < 50°. 
For more details concerning the UZC, refer to Falco et 
al. (1999). 
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TABLE 1 

Voids in the UZC according to Hoyle & Vogeley (2002). 



HV Diameter^ Volume Distance a 5 

# (/i-iMpc) (h-^Mpc^) (h-iMpc) (deg) (deg) 



1 


39.73 


32828 


51.22 


197.8 


73. 


2 


36.12 


24667 


42.09 


247.2 


39. 


3 


33.79 


20204 


49.81 


52.1 


15. 


4 


32.22 


17506 


59.56 


209.7 


52. 


5 


33.92 


20443 


44.54 


168.0 


38. 


6 


29.25 


13104 


49.15 


254.4 


13. 


7 


28.20 


11747 


59.68 


332.5 


21. 


8 


27.63 


11045 


47.74 


196.6 


12. 


9 


26.75 


10025 


38.91 


334.5 


18. 


10 


29.63 


13625 


61.75 


136.9 


48. 


11 


26.29 


9509 


45.36 


162.6 


14. 


12 


23.89 


7142 


61.65 


3.7 


42. 


13 


24.15 


7378 


63.02 


32.5 


19. 


14 


25.54 


8723 


60.93 


256.2 


41. 


15 


26.23 


9454 


51.65 


277.2 


57. 


16 


26.78 


10060 


56.82 


139.4 


65. 


17 


23.72 


6990 


62.58 


212.2 


26. 


18 


21.07 


4901 


30.35 


48.6 


21. 


19 


26.91 


10203 


51.07 


141.2 


25. 



^ Equivalent spherical diameter. 



For this analysis^, we used the same sample as H&V; 
namely, a volume and absolute magnitude limited sample 
with Zjnax — 0.025 and — —18.96. This sample 

contained 3500 galaxies of which 283 were field galaxies. 
The absolute magnitude M was computed according to 
the formula provided in H&V: 

M = mz^^cky - 25 - 5 log [D, (1 + z)] - 3z. (10) 

H&V's analysis of the distribution of galaxies in the 
UZC revealed 19 voids. Our algorithm found 21 voids. 
Figure [T] shows the position of the centers of our voids 
(red circles) and those of H&V (yellow triangles). Also, 
in Tables [T] and [2l respectively, the results from H&V's 
analysis and the present one are tabulated for comparison 
purposes. 

The position of the center of the voids coincides at least 
50% of the time. Indeed, the two analyses are not ex- 
haustive and should be viewed as complementary rather 
than contradictory. However, the sizes of voids tend to 
be different. Consider for example void FNl which can 
be positively identified with H&V's second void (HV2). 
In both cases, it is amongst the largest voids; however, 
HV2 has a radius 64.8% that of FNl. We conclude that 
FNl is in fact an amalgamation of HV2, HV5 and the re- 
gion around Xc = -10 /i~^Mpc, Yc = -25 h'^Mpc, and 
Zc — 15 h^^Mpc^ (see Figure [1}. This discrepancy in 
the volume is probably due to the intrinsically different 
definitions as to what constitutes a void. 

Another interesting void to consider is HV17, which 
has a radius that is 86.5% that of FN20 (it contains 2 
field galaxies only). In this case, the two results are very 
similar. Once more, our analysis tends to produce larger 
voids because, in general, we add more holes together. 
By visual inspection, one can see that both void sizes 

^ We also carried the analysis on the Point Source Catalog red- 
shift survey (PSCz). However, we chose not to include it in this 
paper as it does not contain any significantly new information. 

^ {Xc,Yc,Zc) are the cartesian comoving coordinates with origin 
coinciding with that of equatorial coordinates. 



TABLE 2 

Voids in the UZC found using the same selection criteria 
AS H&V. 



FN Diameter^ Volume Distance a 5 

# (/i-iMpc) (/i-^Mpc^) (/i-iMpc) (deg) (deg) 

1 55.70 90482 40.1 259.15 31.61 

2 50.74 68399 45.5 60.61 18.89 

3 46.84 53808 56.1 293.07 66.03 

4 46.24 51767 55.3 331.00 18.71 

5 41.08 36299 63.1 218.93 49.87 

6 37.78 28235 64.7 0.09 42.57 

7 37.42 27435 44.3 196.21 14.75 

8 36.66 25797 49.8 187.95 71.34 

9 36.28 25003 60.9 125.50 9.85 

10 35.98 24388 45.4 165.81 37.24 

11 35.78 23984 61.6 138.80 49.78 

12 34.54 21576 35.5 12.77 26.27 

13 33.46 19614 64.6 28.52 21.18 

14 31.08 15720 63.8 91.29 62.71 

15 29.90 13996 63.3 306.32 9.93 

16 28.52 12146 64.3 242.21 11.97 

17 28.08 11593 47.0 162.45 9.09 

18 27.42 10794 65.5 211.92 25.96 

19 25.86 9055 51.9 225.87 7.36 

20 25.68 8867 44.3 102.61 50.52 

21 25.56 8743 62.4 105.34 31.57 

^ Equivalent spherical diameter. 

are possible. It should be noted that our voids are not 
always consistently larger than those of H&V. 

4. ROBUSTNESS OF THE ALGORITHM 

There are several parameters used in the void finder 
algorithm that infiuence the number of voids found. Ide- 
ally, a void finding algorithm should find a consistent 
number of voids regardless of the values of the input 
parameters. To determine which values of the primary 
parameters (A, ^) should be used as the default val- 
ues for subsequent void identifications, several runs were 
made using the UZC for which the "secondary parame- 
ters" were varied. The reason these two parameters are 
grouped together for this analysis is because, as opposed 
to any of the other parameters, their influence on the 
number of voids identified is closely interrelated. This 
was only found after considerable numerical experimen- 
tation. 

The "effective" secondary parameters, referred to 
above, are: (1) the width of the disk of the Milky Way 
(MW width) given in degrees of galactic latitude; (2) the 
extension to the maximum radial boundary of the survey 
(Extension) given in /i~^Mpc; and, (3) which deter- 
mines how holes are merged to form voids. The param- 
eter P2 and (3^ are not included in this analysis as their 
value does not infiuence the number of voids found. To 
compute the robustness of a particular combination (A, 
^), the standard deviation for each permutation of the 
other parameters was calculated and it was normalized 
to the average number of voids (a/fi). This parameter 
is referred to as the normalized variability. As can be 
seen in Figure O the combination of A and ^ that led to 
the most robust results (i.e., the smallest variability) was 
chosen as the default (i.e., A — 2.0, ^ = 12/i^^Mpc). 

We find that the mean number of voids for each pair 
of parameters (A, ^) varies from between 10 to 28 voids. 
This is a significant difference and is one of the weak- 
nesses of our automatic void finding algorithm. For the 
optimum choice of parameters, the mean number of voids 



55.70 


90482 


40.1 


259.15 


31.61 


50.74 


68399 


45.5 


60.61 


18.89 


46.84 


53808 


56.1 


293.07 


66.03 


46.24 


51767 


55.3 


331.00 


18.71 


41.08 


36299 


63.1 


218.93 


49.87 


37.78 


28235 


64.7 


0.09 


42.57 


37.42 


27435 


44.3 


196.21 


14.75 


36.66 


25797 


49.8 


187.95 


71.34 


36.28 


25003 


60.9 


125.50 


9.85 


35.98 


24388 


45.4 


165.81 


37.24 


35.78 


23984 


61.6 


138.80 


49.78 


34.54 


21576 


35.5 


12.77 


26.27 


33.46 


19614 


64.6 


28.52 


21.18 


31.08 


15720 


63.8 


91.29 


62.71 


29.90 


13996 


63.3 


306.32 


9.93 


28.52 


12146 


64.3 


242.21 


11.97 


28.08 


11593 


47.0 


162.45 


9.09 


27.42 


10794 


65.5 


211.92 


25.96 


25.86 


9055 


51.9 


225.87 


7.36 


25.68 


8867 


44.3 


102.61 


50.52 


25.56 


8743 


62.4 


105.34 


31.57 
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Fig. 1. — Voids in the UZC as found in our analysis and that of H&V. The red circles and yellow triangles represent the center of the 
voids in the former and in the latter analyses, respectively. The black dots represent wall galaxies and the hollow diamonds represent field 
galaxies. The volume of the survey has been divided into six slices. The comoving Cartesian coordinates are shown and the limits on Zc 
for each slice are also noted. 



found is 12. Although we could have chosen parameters 
which would give us a number of voids more nearly equal 
to the mean number (all combinations being averaged), 
we believe that it is more important to balance this with 
minimal variability. It is worthwhile noting that the ab- 
solute number of voids in any sample is relatively uninter- 
esting because it cannot be rigorously defined. However, 
what is important is that an algorithm be robust (e.g., 
self-consistent, resistant to variability) so that the rela- 
tive differences in void properties from sample to sample 
can be reliably determined. 

5. STATISTICAL SIGNIFICANCE OF VOIDS 

The statistical significance of void detection was tested 
using resampling techniques similar to those used by 
Kauffmann & Fairall (1991). This method consists in 
first finding the number of voids in the original data, 
and then redistributing the data randomly and finding 
how often a void is detected as a result of statistical 
fluctuations. This randomization experiment is repeated 
many times. This test for the statistical significance of 
the voids was done for the UZC using the parameters 
from H&V's analysis. It was found that 98% of the 
time no voids were identified in the randomized data. 
For our own choice of default parameters, we found that 
more than 99.5% of the time no voids were identified. 
Analogous to the methods used in parametric statistics, 
we can conclude that the probability of committing a 
pseudo-Type I error (i.e., claiming to find cosmological 
voids when in fact the void identification is due to a ran- 
dom fluctuation or a deficiency in the algorithm) is 0.5%. 
The equivalent "Type II error" (i.e., finding no voids in 
a given volume when in fact voids exist) is very difficult, 
if not impossible, to quantify. 
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Fig. 2. — Summary of the variability analysis. The set of primary 
parameters which yields the smallest variability is A = 2.0, ^ = 
12h-iMpc. 

Based on our randomization experiments, we conclude 
that our void finding algorithm is conservative and thus 
voids identified by the algorithm are likely to be signif- 
icant. This is a very important and stringent test since 
meaningful cosmological inferences can only be made if 
the void identification is truly reliable. 
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6. THE SLOAN DIGITAL SKY SURVEY (SDSS) 

The Sloan Digital Sky Survey is the largest astronom- 
ical survey to date. Upon completion, it will have sys- 
tematically surveyed over one quarter of the sky. The 
photometric and spectroscopic data is being released pe- 
riodically. Data Release Five (DR5; Adelman-McCarthy 
et al. 2007) has been available for download since June 
2006. It contains over one million spectra, which are 
critical to the study of large-scale structures. 

The SDSS database offers several possibilities as to the 
type of magnitude system available and we investigated 
most of them. The best choice was the Petrosian mag- 
nitude system and we adopted it. Petrosian magnitudes 
are measured using an adjustable aperture which corrects 
for the fact that more distant galaxies appear smaller in 
size. For this reason, Petrosian magnitudes are a better 
measure of the true apparent brightness of a galaxy (Pet- 
rosian 1976). In other words, the Petrosian magnitude 
of a galaxy will be approximately the same regardless of 
its distance from us. 

The SDSS data is measured through five filters (i.e., 
M, g, r, i, z). We used absolute magnitudes in the r- 
band because we found that the measured if -correction 
showed less spread than in the other filters (see Figure 
15 of Blanton & Roweis 2007). 

We have included reddening and an effective K- 
corrections using the following formula: 

Mr^mr + b~b log(i:»c(l + z)) - Ar~ K (z) (11) 

where nir is the apparent Petrosian magnitude in the 
r-band, z is the redshift, Dc is the comoving distance 
given by Equation (jH) and thus Dc{l + z) is the luminos- 
ity distance, and Ar is the Galactic extinction in mag- 
nitude units. Precise JT-corrections are complicated to 
compute, thus we have used an effective if-correction of 
K{z) « 1.05z after inspection of figure 15 of Blanton & 
Roweis (2007). There is some scatter around this line 
because if-corrections depend on the ensemble averages 
of the spectral energy distribution (SED) of each indi- 
vidual galaxy. Nonetheless, we believe that this is still 
a reasonably good approximation for the redshift range 
for which we are applying the correction. Indeed, the 
size of the scatter is comparable to the uncertainty in 
the magnitude for low redshifts. Moreover, using the 
words of Blanton et al. (2003), "if -corrections are in- 
herently uncertain" even when "properly" computed. In 
fact, we have redone the statistical analyses (described in 
the next section) with no if -corrections and found that 
the conclusions were unaffected. 

We selected an absolute magnitude and volume limited 
sample. We used the data located in the volume 140° < 
a < 230° and 30° < (5 < 65° (2286 square degrees) up to 
a redshift of 0.16 with absolute magnitude in the r-band 
in the interval —22.3 < Mr < —20.8. There are a few 
small unsampled regions or holes in the data and they 
are treated in the same way as the outer boundaries. 

We found a total of 232 voids in the sam- 
pled volume. A void catalog describing the prop- 
erties of these v oids can be fo und at the URL 
http://physics.ubishops.ca/sdssvoidsl T^^^ cata- 
log is a compendium of information containing an up 
to date list of the properties of each void such as their 
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Fig. 3. — Distribution of the mean nearest-neighbor distance as 
a function of redshift for the selected sample. The distribution is 
overall uniform with a mean nearest-neighbor distance of fx 3.0 
h-l Mpc. 

position, the size of the axes of the best-fit ellipsoid and 
the equivalent spherical radii. It should be noted that 
we decided to discuss the properties of only those voids 
whose center is located at a redshift smaller than 0.16 
because of the rapid decrease in the number density of 
galaxies (especially beyond z = 0.2). 

7. VOID PROPERTIES 

In order to interpret the properties of the cosmologi- 
cal voids in our sample, the results must be contrasted 
with theoretical predictions. Large N-body simulations 
are typically generated in order to understand the for- 
mation and evolution of structure in the universe under 
a given paradigm. According to the ACDM model, voids 
in dark matter halo distributions grow outward from neg- 
ative density perturbations and tend toward higher and 
higher sphericity until their boundary reaches that of a 
neighboring growing underdensity; this moment is known 
as shell-crossing. If a small underdensity is embedded 
inside a dense region, it collapses and disappears. After- 
ward, voids either expand with the Hubble flow or merge 
to form even larger voids (for more details on the theoret- 
ical evolution of voids, see Bond & Jaffe 1999; Colberg et 
al. 2005; Dubinski et al. 1993; Sheth & van de Weygaert 
2004). Thus, assuming that galaxies are reasonable trac- 
ers of the underlying dark matter distribution, voids in 
comoving coordinates should either grow or remain ap- 
proximately constant in size with time (i.e., decreasing 
redshift) on average as they expand and merge. Given 
the tremendous wealth of observational data, it is worth- 
while to investigate whether or not the theoretical predic- 
tions for the distribution and evolution of voids in dark 
matter distributions can be confirmed by the currently 
available observational data of galaxy distributions. 

7.1. Void Sizes 

We have attempted to study the variation in void sizes 
as a function of redshift in order to probe void evolu- 
tion. Our range of redshift (i.e. z < 0.16) corresponds to 
a look-back time of approximately t^ = 1.36/i~^ Gyrs, 
which is a relatively small fraction of the history of the 
universe. Thus we do not expect to see significant evo- 
lution over this redshift range. Nevertheless, this is an 
interesting pilot study which can easily be followed up 
as the next generation of large redshift surveys, probing 
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Fig. 4. — The equivalent (grey points) and average equivalent 
comoving radius of voids (black points) as a function of redshift in 
the SDSS. The error bars show the standard error on the average. 
The slope of the best fit line (solid line) to the average equivalent 
radii for z < 0.04 is consistent with zero. The data to the left of 
the vertical dashed line have been removed from this analysis in 
order to reduce edge effects due to the size and shape of the volume 
and errors due to cosmic variance. 

larger volumes and higher redshifts, become available. 
The main confounding factor in studying the change in 
sizes of voids with redshift is due to variations in the 
mean nearest-neighbor distance ((r„„))^ as a function 
of redshift (we are assuming a sufficiently high-density, 
volume-limited sample). We have examined this issue 
by arbitrarily removing galaxies from samples and we 
conclude that the sizes of voids increase linearly with in- 
creasing (r„n). Indeed, past analyses have shown this 
strong dependency of the sizes of voids on the density of 
galaxies (MuUer et al. 2000; MuUer & Maulbetsch 2004). 

With respect to our sample (see Figure[3]), (r„„) is rea- 
sonably constant. For this reason, we are confident that 
any observed change in the average sizes of cosmological 
voids over the entire range of redshifts is independent of 
{Tnn)- We find a roughly constant void volume filling 
fraction of 81% for 0.04 <z< 0.16. 

We group voids in redshift bins of Az = 0.02 and ob- 
tain the average size of voids for each bin. We decide 
to exclude the data below z = 0.04 from our analysis 
to minimize possible edge effects due to the shape and 
size of the sampled volume and the effects of cosmic vari- 
ance. We use a weighted least squares analysis to apply a 
weight to each point according to its standard error and 
find that the rate of increase in the average size of voids 
as a function of redshift is m = —7 ± 20/i~^Mpc per unit 
redshift (see Figure[4|). Thus, the slope is consistent with 
being null or negative as predicted by theory. 

Therefore, while there is a hint that voids may be in- 
creasing in size with time (i.e. decreasing redshift) over 
the limited redshift range probed, the data is statistically 
consistent with no size evolution. Our sample encom- 
passes 40% of DR5 or 5% of the whole sky, therefore it 
might suffer from cosmic variance especially at low red- 
shit. For this reason, this study could benefit from a 
larger sample, either covering a more expansive propor- 
tion of the sky or various directions. 

We define (r„„) = X]"=i ('"i) / (1) where n is the number of 
galaxies in a given bin and is the distance to the nearest-neighbor 
for the i*'' galaxy. Keeping (r„„) constant ensures a homogeneous 
sampling even if structures such as large voids change the overall 
number density in a bin. 



7.2. Shape and Orientation 

We have also analyzed the distribution of the orienta- 
tion and shape of our identified cosmological voids. Per- 
haps not surprisingly, we find a slight majority of prolate 
voids {Taverage = 0.67 ±0.09) in agreement with the the- 
oretical models of Platen et al. (2007). Figure [5] shows 
the bivariate distribution of ei versus £2- 

Following the suggestion of Ryden & Mellot (1996, see 
Section [T]) , we quantify the orientation of cosmological 
voids with respect to the linc-of-sight as a measure of 
the effects of redshift-space distortions on our identified 
cosmological voids. We find an average orientation with 
respect to the line-of-sight of {(p) = 57.9°. In order to de- 
termine whether or not the distribution of our measured 
values of cj) is consistent with being drawn from a ran- 
dom distribution, we used a bootstrap resampling test. 
We took 100,000 samples of 222 voids from our sample 
for 0.04 < z < 0.16 by sampling with replacement. The 
99% confidence interval on the average orientation was: 
54.6° < {(j)) < 61.2°. A similar analysis was performed 
for the median (cj)) of the population. The 99% confi- 
dence interval on the median was: 53.6° < cf) < 65.5°. 
This is consistent with an isotropic distribution of ori- 
entations for which we would expect the (p to be 60° 
and {(j)) to be 57.3°. We also looked at the orientation 
of voids in redshift slices of thickness Az = 0.01. We 
conclude that the scatter in each slice is consistent with 
random statistical fluctuations in the orientations within 
each slice. However, there is the suggestion that redshift- 
space distortions may be causing the voids at redshifts 
of approximately 0.10 (on average lower values of 0) to 
have increasingly higher values of as z — > 0.16. Indeed, 
it can be seen in Figure [6] that there are relatively fewer 
voids at high redshifts with low values of (j) compared to 
z — 0.1. This result is sufficiently interesting that larger 
portions of the sky need to be analyzed before a definitive 
statement can be made. 

There are several caveats that should be noted with re- 
spect to the analysis of the orientations of voids and their 
connection to redshift-space distortions. First, as with 
the above radius analysis, this result may be affected by 
cosmic variance, especially at low redshift. Second, us- 
ing N-body simulations. Platen et al. (2008) have shown 
that the ACDM model predicts that the orientation of 
voids in the distribution of dark matter haloes can be 
correlated for scales < 30h~^Mpc. On the other hand, 
they also find that this predicted void alignment becomes 
less significant for larger scales. They demonstrate that 
the alignment is likely related to the cosmic tidal field. 
While our sample reaches well beyond this scale, we can- 
not rule out the effects of tidal alignment on our results. 

7.3. Comparison to ACDM 

In order to test whether or not our results are con- 
sistent with the predictions of the ACDM model, we 
perform an identical analysis on a mock SDSS catalog 
lightcone of Croton et al. (2006). The mock catalog is 
obtained from the Millennium Simulation via the use of 
the semi-analytic model of Croton et al. (2006) with the 
dust prescription of Kitzbichler & White (2007). We use 
the "observed" redshift (i.e., including pecuhar veloci- 
ties) in order to directly compare with our observational 
analysis of SDSS. We find that the average distance to 
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Fig. 5. — The distribution of the shapes of the voids in our sam- 
ple. A slight majority of voids are prolate in shape. 
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Fig. 6. — Orientation of cosmological voids as a function of red- 
shift. There is no significant trend with redshift. The average 
orientation is <^ = 57.9°. 

the nearest neighbor (i.e. the equivalent of Figure [3]) is 
also constant, albeit at roughly 2.5/i~^ Mpc, for z < 0.14. 
Moreover, the magnitude limited luminosity function is 
in good agreement with the SDSS dataset for redshifts of 
0.06 < z < 0.12. This ensures that the results obtained 
from running our void finding algorithm on the mock 
catalog are comparable to those of the SDSS catalog. 

While the redshift range is slightly smaller than for 
the SDSS survey, we mostly find reasonable agreement 
between the theoretical and observational analyses. In- 
deed, the equivalent comoving radii of cosmological voids 
agree well (see Figure [7]). 

According to our definition of voids, the void volume 
filling fraction for both the mock catalog and the SDSS 
sample for the same redshift range is approximately 80%. 
Specifically, we find a void volume filling fraction to be 
77% for the mock catalog and 83% for the SDSS sample 
(0.06 < z < 0.12). We also find a slight majority of 
prolate voids with an average triaxiality parameter of 
Taverage — 0.65 ± 0.09 in good agreement with the SDSS 
results. This can be seen in Figure [8] where we show the 
bivariate distribution of 62 versus ei. 

In contrast to the SDSS void sample, the 99% confi- 
dence interval for the average orientation of cosmological 
voids in the mock catalog is 61.3° < {(/)) < 69.3°. The 
results strongly indicate (with 99.99% confidence) that 
the voids in the mock catalog are on average compressed 
along the line-of-sight (i.e., > 60°). A similar con- 



50 - 



40 



30 



...f 



0.06 



0.08 0.1 
Redshift (z) 



0.12 



Fig. 7. — The equivalent (grey points) and average equivalent 
comoving radius of voids (black points) as a function of redshift in 
the mock SDSS catalog. The error bars show the standard error 
on the average. 
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Fig. 8. — The distribution of the shapes of the voids in our mock 
galaxy sample. As with the SDSS catalog, a slight majority of 
voids are prolate in shape. 

elusion that cf) is significantly greater than the value for 
random orientations (i.e., 60°) is also evident. 

In order to quantify whether the void orientations from 
the mock catalog are different from those of our SDSS 
sample over the same redshift range, we use a bootstrap- 
ping method to evaluate the probability that the two 
samples are derived from the same population. We find 
that the probability that the two samples have the same 
values of {(p) is less than 0.2% and that they have the 

same values of (j) is less than 0.1%. Provided that the 
two samples are indeed comparable, this discrepancy be- 
tween our analysis of the mock galaxy catalog and that 
of SDSS could be reconciled through fine-tuning of the 
initial conditions of the assumed cosmology or of the in- 
put semi-analytic prescriptions. However, the details of 
the required fine-tuning are non-trivial and beyond the 
scope of this paper. 

8. CONCLUSIONS 

Cosmological voids have only been rigorously studied 
for the past 20 years. Much effort has been put into 
trying to find out if cosmological parameters can be ex- 
tracted from the study of large-scale structures. Our 
primary focus is on the quantification of the properties 
of cosmological voids. In order to accomplish this, we 
look at several void finding algorithms and choose to im- 
plement an algorithm similar to that of H&V. Our algo- 
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rithm is designed to objectively identify voids in a galaxy 
distribution in a manner that would best reproduce the 
results from a visual inspection. 

The void finding algorithm is thoroughly tested using 
the UZC and the PSCz survey, and our results are com- 
pared with those of H&V. In general, good agreement is 
found between the two methods. Our algorithm proves 
to be very conservative which guarantees with a high 
probability that the identified cosmological voids are not 
spurious. 

Using the results from DR5 of the SDSS, we carefully 
identify regions where the survey data is nearly spatially 
contiguous and use that data to identify voids. Our 
galaxy sample is selected in the region corresponding to 
140° < a < 230° and 30° < S < 65° up to a redshift of 
z = 0.16. We then fix the absolute magnitude limits to 
obtain a homogeneously sampled dataset of galaxies. 

We repeat our analysis on a mock SDSS catalog based 
on the semi-analytic models of Croton et al. (2006). The 
magnitude limited luminosity function and the average 
distance to the nearest neighbor in the galaxy sample de- 
rived from the mock catalog show good agreement with 
our SDSS sample for redshifts 0.06 < z < 0.12. This 
guarantees that the results from both analyses are com- 
parable. 

The observational (SDSS) and theoretical (mock cat- 
alog) analyses yield similar void sizes and void volume 
filling fractions. We find no statistically significant trend 
between the average size of cosmological voids and red- 
shift for 0.04 < z < 0.16 in the SDSS sample. We obtain 
a slight majority of prolate voids in both the observa- 
tional and theoretical dataset. 

Following the suggestion of Ryden & Mellot (1996), we 
study the distribution of the orientation of cosmological 
voids with respect to the line-of-sight in redshift space as 
a probe of redshift-space distortions. We find that the 
orientations of cosmological voids in the SDSS sample 
with respect to the line-of-sight are consistent with be- 
ing randomly distributed, indicating that redshift-space 
distortions are either not present or too weak to be de- 
tected with the present analysis. Indeed, both the me- 
dian and average orientation of cosmological voids in the 
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SDSS are consistent with being drawn from an isotropic 
distribution. 

In contrast, the orientation of voids in the mock cata- 
log tend to be compressed along the line-of-sight. Using 
bootstrapping methods, we find a 0.1-0.2% probability 
that the distribution of the orientation of cosmological 
voids over a similar redshift range in both samples are 
consistent with having been drawn from the same par- 
ent population. Assuming that this difference in the 
alignment of voids with respect to the line-of-sight be- 
tween the mock and the SDSS data is real and due to 
redshift-space distortions, which are influenced by the 
initial conditions of the universe in a non-trivial man- 
ner, studies such as ours could in principle help better 
constrain the cosmological parameters and semi-analytic 
models. While beyond the scope of the present project, 
the same analysis could be performed on mock catalogs 
from cosmological simulations of varying initial condi- 
tions and input semi-analytic model prescriptions. 

Finally, given the existence of other void finding algo- 
rithms that use a wide variety of techniques to identify 
voids (see Colberg et al. 2008), this analysis could be 
repeated using a totally different set of criteria for void 
identification and thus the validity of the measured prop- 
erties can be independently verified. This would help 
constrain the possibility of systematics introduced by our 
void finding method. 
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